TOP: A Compiler-Based Framework for Optimizing Machine Learning Algorithms through Generalized Triangle Inequality

نویسندگان

  • Yufei Ding
  • Lin Ning
  • Hui Guang
  • Xipeng Shen
  • Madanlal Musuvathi
  • Todd Mytkowicz
چکیده

This paper describes our recent research progress on generalizing triangle inequality (TI) to optimize Machine Learning algorithms that involve either vector dot products (e.g., Neural Networks) or distance calculations (e.g., KNN, KMeans). The progress includes a new form of TI named Angular Triangular Inequality, abstractions to enable unified treatment to various ML algorithms, and TOP, a compilerbased optimizer for effectively applying TI to optimize machine learning algorithms. Experiments show that TOP is able to automatically produce optimized algorithms that either matches or outperforms manually designed algorithms, giving up to 237x speedups and 2.5X on average.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TOP: A Framework for Enabling Algorithmic Optimizations for Distance-Related Problems

Computing distances among data points is an essential part of many important algorithms in data analytics, graph analysis, and other domains. In each of these domains, developers have spent significant manual e↵ort optimizing algorithms, often through novel applications of the triangle equality, in order to minimize the number of distance computations in the algorithms. In this work, we observe...

متن کامل

Symmetrized Bregman Divergences and Metrics

While Bregman divergences [3] have been used for several machine learning problems in recent years, the facts that they are asymmetric and does not satisfy triangle inequality have been a major limitation. In this paper, we investigate the relationship between two families of symmetrized Bregman divergences and metrics, which satisfy the triangle inequality. Further, we investigate kmeans-type ...

متن کامل

Improving the Performance of Machine Learning Algorithms for Heart Disease Diagnosis by Optimizing Data and Features

Heart is one of the most important members of the body, and heart disease is the major cause of death in the world and Iran. This is why the early/on time diagnosis is one of the significant basics for preventing and reducing deaths of this disease. So far, many studies have been done on heart disease with the aim of prediction, diagnosis, and treatment. However, most of them have been mostly f...

متن کامل

Automatic Selection of Machine Learning Models for Compiler Heuristic Generation

Machine learning has shown its capabilities for an automatic generation of heuristics used by optimizing compilers. The advantages of these heuristics are that they can be easily adopted to a new environment and in some cases outperform hand-crafted compiler optimizations. However, this approach shifts the effort from manual heuristic tuning to the model selection problem of machine learning – ...

متن کامل

Scientific Flow Field Simulation of Cruciform Missiles Through the Thin Layer Navier Stokes Equations

The thin-layer Navier-Stokes equations are solved for two complete missile configurations on an IBM 3090-200 vectro-facility supercomputer. The conservation form of the three-dimensional equations, written in generalized coordinates, are finite differenced and solved on a body-fitted curvilinear grid system developed in conjunction with the flowfield solver. The numerical procedure is based on ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018